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Abstract 


A wide variety of modern chess software products is available to the modern professional and amateur chess 
players alike, helping them improve their chess skills and prepare for online and traditional tournaments. These 
products include chess user interfaces (UIs), traditional Alpha-Beta (AB) and emergent Neural Network (NN) 
chess engines, game databases, opening databases and electronic books, chess-specific cloud services, tourna- 
ment broadcast tools, online tutorials, tactical problem collections, and endgame tablebases (EGTBs). All of 
these tools except the last two categories can be used to work on opening preparation, an important component 
of chess training. In this paper, the author presents his computer-based approach to opening preparation tested 
in chess classes at the Russian School of Indiana for advanced beginner players. The materials used to develop 
the approach included game openings from the games played in the Free Open-Source Chess Engine Contest 
(FOSCEC) broadcast online by the author’s CIT students at Purdue Polytechnic Columbus. We will discuss the 
choices of tools and equipment, how the more popular and/or promising opening variations were identified and 
analyzed, the lessons learned, and the future work. 


INTRODUCTION 


Chess is a classical turn-based strategy game played on an 8x8 physical or virtual board with white and black 
pieces (pawns, knights, bishops, rooks, queens, and kings). The game enjoys broad popularity worldwide, as its 
rules are pretty easy to learn, the design is well-balanced, and the gameplay can be a lot of fun. However, the 
task of mastering the deeper intricacies of chess presents a formidable challenge. Many software applications 
have been developed to implement chess as a video game on numerous electronic platforms and help millions 
of novices, seasoned amateurs, and chess professionals improve and maintain their skills. These applications 
include the following. 


1. Chess engines capable of analyzing the game’s positions and playing the game. They utilize variations 
of the traditional Alpha-Beta (AB) minimax algorithm, emergent techniques based on Neural Networks 
(NN), and, most recently (since 2020), extremely successful hybrids thereof, such as Stockfish 12+ with 
NNUE (Efficiently Updatable Neural Network) for position evaluation (Stockfish, 2021). 


2. Game databases, such as Mega Database (2020) of human over-the-board (OTB) games, Tim Harding’s 
UltraCorr 2021 database of correspondence games (UtraCorr, 2021), and CCRL’s database of games 
played by computer chess engines (CCRL, 2021). 


3. Chess user interfaces (UIs) that facilitate communication between the user and the engine(s) and/or da- 
tabases. They include both commercial products, such as ChessBase 16 (2021), and free solutions, such 
as Arena (2021) or Shane's Chess Information Database (SCID, 2021). 
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4. Chess tournament broadcast tools, such as Norman Schmidt’s CCCC (2018). 


5. Opening databases and electronic books for chess engines to use early in the game and for human play- 
ers to explore and learn chess openings. They include the Cerebellum opening book for Brainfish by 
Thomas Zipproth (Cerebellum, 2021) and Fauzi Dabat’s opening book (Fauzi, 2021). 


6. Chess-specific cloud services, such as the ChessBase Engine Cloud (2021). 
7. Tactical problem/puzzle collections, such as those available at chess.com (2021) and lichess.org (2021). 


8. Endgame tablebases (EGTBs), such as Ronald de Man’s Syzygy (2021) and the pioneering 7-piece Lo- 
monosov tablebases (2012) calculated at the Computer Science Department of Moscow State Universi- 


ty. 


A chess game is traditionally divided into three stages: opening, where a lot of attention is paid to piece devel- 
opment and control over the center of the board; the middlegame, with its tactics and strategies of attack and 
defense; and the endgame, where few pieces are left on the board, so the kings become active and pawn promo- 
tion gains utmost importance. All of the software tools listed above except the last two categories can be used to 
work on opening preparation, an important component of chess training. 


In this paper, the author, an International Chess Federation (FIDE) National Instructor, presents his computer- 
based approach to opening preparation tested in chess classes taught online at the Russian School of Indiana 
(2021). The materials used to develop the approach include openings from the computer chess games played in 
the Free Open-Source Chess Engine Contest (FOSCEC, 2014) broadcast online by the author’s CIT undergrad- 
uate students at Purdue Polytechnic Columbus as part of the chess-related projects described in detail in the au- 
thor’s previous work (Gusev, 2018). 


In the next sections of the paper, we will discuss the choices of tools and equipment, how the more popular 
and/or promising opening variations were identified, ordered, illustrated, and analyzed. We will then present our 
conclusions and discuss plans for the future work. 


TOOLS AND EQUIPMENT 


Even though many modern chess engines have been ported to other platforms, such as Linux and Android (Ab- 
shire and Gusev, 2015), the author used Windows laptops, desktop workstations, and servers for this project to 
take advantage of the convenient chess GUI tools — ChessBase 13, Deep Fritz 14 (2014), and Arena 3.5.1. The 
author enhanced the truly massive Computer Chess Rating Lists (CCRL) database of 3,172,504 games played in 
2005-2019 by adding 215,485 engine games from numerous other sources, including 4 seasons of FOSCEC, 44 
themed opening tournaments ran by the author in 2012-2019, the first 15 seasons of the Top Chess Engine 
Championship (TCEC) (2021), the Chess Engines Grand Tournament (CEGT) archive, 8 Computer Chess 
Championship (CCC) events (CCC, 2019), the FastGM (2021) archive, World Computer Chess Championship 
(WCCC) and World Chess Solving Championship (WCSC) events held by the International Computer Games 
Association (ICGA) (2021), the AlphaZero vs. Stockfish 8 match, Frank Quisinsky’s FEOBOS project (2018), 
etc. This tool dubbed CCRL+ was then used, along with Mega Database 2019 (7,519,541 Over the Board 
(OTB) games)) and the engine evaluations from the ChessBase Cloud, combined with the local Cfish evalua- 
tions (Cfish, 2017) to identify the more popular and/or promising variations to bring to the attention of the au- 
thor’s advanced beginner chess students. This part of the work was completed using ChessBase 13. 
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For the purpose of illustration, the popular variations were extended using Brainfish version from February 8, 
2019 configured to use its Cerebellum opening book under Arena 3.5.1. The details of the process will be ex- 
plained in the subsequent sections. 


The rationale for tool and equipment selection, besides the obvious availability considerations, involved the re- 
alization that the current situation is opposite to what the author experienced back in the 1980s, when it was 
hard to find comprehensive information on chess openings. We have too much data! No person can view, much 
less analyze, millions of games. Some of the modern Big Data is of much better quality than what was available 
in the old days, while some other information is just as unreliable as it used to be. We have selected tools and 
equipment suitable for quick massive statistical processing of the chess game data. 


IDENTIFICATION OF POPULAR VARIATIONS 


The two big categories of what the students of chess should learn are what to do and what not to do in the open- 
ings. We will concentrate on the former aspect, leaving the latter to the authors of books on opening traps and 
catastrophes, such as (Wall, 2010). That other kind of books has great entertainment value, but should not be 
substituted for serious opening research of more practical value. 


Initially, we aimed at selecting 1,023 popular and/or promising variations, this “magic number” being the limit 
of how many games Arena 3.5.1 can play in one match. Our rule of thumb was, therefore, to stop splitting Mega 
Database 2019 variations once they were down to approximately 7,519,541:1,023 = 7,350 games per variation. 
We took into consideration the percentages of points won by the white and provided by ChessBase, along with 
the cloud and local engine evals. The Principal Variation (PV) is a sequence of moves that an engine considers 
best and therefore expects to be played. The number of PVs to be analyzed locally in a given position remained 
set at 5 most of the time, so as to explore at most 5 possibilities at a time. 


As expected, the variations would not split evenly on popularity. Furthermore, even as we kept track of how 
many variations we expected to pick starting from each position, we kept encountering “promising” continua- 
tions, which, while not being popular, showed good percentages and good computer evals. Once we were down 
to four variations to pick starting from a given position, we were able to complete selection directly, with a little 
bit of effort. 


One major nuisance that made our recursive process more complex was that, from time to time, we would 
stumble upon a popular transposition of moves leading to the same position that we had encountered or were 
about to encounter elsewhere in the search tree. (Not all move transpositions are legal, and not every legal 
transposition of moves is safe.) We kept track of such popular transpositions and added their contributions to 
the corresponding positions to allow more branching to happen afterwards, according to the total combined 
popularity of the position. 


With those practical considerations in place, the manual selection process produced a set of 1,442 popular 
and/or promising variations and 167 popular transpositions. Given that the first move 1. e4 occurs in 50.9% 
(~1/2) of the OTB games, and the move second to that in popularity, 1. d4, happens in 31.9% (~1/3) of the 
games, the variations were divided nearly uniformly into six volumes: 


Volume 1. Sicilian Defense (1. e4 c5) — 304 variations (21.1% of all selected variations) to cover 
20.4 % of all OTB games in Mega Database 2019. 
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Volume 2. Open Game (1. e4 e5) — 219 variations (15.2% of variations) to cover 12.4% of the 
games. 


Volume 3. Non-Sicilian Semi-Open Games: |. e4 e6 (French Defense), 1. e4 c6 (Caro-Kann De- 
fense), 1. e4 d6 (Pirc Defense), 1. e4 d5 (Scandinavian Defense), 1. e4 g6 (Modern Defense), 1. 
e4 Nf6 (Alekhine Defense), 1. e4 Nc6 (Nimzowitsch Defense), 1. e4 b6 (Owen Defense), and 1. 
e4 a6 (St. George Defense) — 254 variations (17.6% of variations) to cover 18.0% of the 
games. 


Volume 4. Indian Defense (1. d4 Nf6) — 249 variations (17.3% of variations) to cover 17.0% of the 
games. 


Volume 5. Non-Indian responses to Queen’s Pawn Game (1. d4 followed by moves other than 1... 
Nf6) — 238 variations (16.5% of variations) to cover 14.8% of the games. 


Volume 6. Openings that do not begin with 1. e4 or 1. d4: 1. Nf3 (Reti Opening), 1. c4 (English 
Opening), 1. f4 (Bird Opening), 1. g3 (Benko Opening), etc. — 178 variations (12.3% of varia- 
tions) to cover 17.2% of the games. (~0.2% of the games “fell through the cracks”, due to their 
very unusual openings. This number will grow considerably, once we take into consideration 
unusual continuations ignored later in the tree search.) 


ORDERING THE POPULAR VARIATIONS 


Many traditional books on chess openings, including the famous Encyclopedia of Chess Openings (ECO, 2000- 
2008), begin their discourse with rare variations to proceed gradually to the more common ones. Even as we 
stored the ECO codes assigned automatically by ChessBase and Arena, we have opted for the lexicographic or- 
dering based on our hex line codes that capture priorities of the variations according to their Mega Database 
popularity, with some exceptions made for promising variations. Figure 1 shows a fragment of an Excel work- 
sheet illustrating our greedy Bottom Line Up Front (BLUF) approach, where the “bottom line” is the first line, 
in the conventional chess terminology. 


For example, Line 300 (not seen in the figure) has the hex line code of 11C, where the hexadecimal digit C cor- 
responds to the decimal number 12. The corresponding variation is 1. e4 c5 2. b4 (B20 Sicilian: Wing Gambit). 
Indeed, 1... cS is the most popular response to 1. e4, and 2. b4 is the 12" most popular reply to that, after 2. Nf3 
(1), 2. Ne3 (2), 2. c3 (3), 2. d4 (4), 2. f4 (5), 2. d3 (6), 2. c4 (7), 2. b3 (8), 2. Bc4 (9), 2. Ne2 (A), 2. g3 (B), and 
before 2. a3 (D). Even though we do not envision the need to consider more than 15 continuations in a practical 
opening position, we could use subsequent letters of the English alphabet after F (the largest hexadecimal digit 
corresponding to the decimal 15) if we needed to — G for the hypothetical 16™ choice, H for the 17" choice, 
and so on, thus trivially extending our approach. 


Notice that Line | in Figure 1 corresponds to the ECO code of B98 (Sicilian: Najdorf, 7.f4 Be7), which means 
that the information on this line would be found near the end of the second volume (Volume B) of the Encyclo- 
pedia of Chess Openings. This line was played in 12,360 Mega Database 2020 OTB games (0.154%), 8,580 UI- 
traCorr 2021 correspondence games (0.379%), and 3,196 CCRL+ engine games (0.094%). 
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ary 1. Lexicographic ordering of the selected popular variations by hex line codes. 
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For comparison, Line 300 (Wing Gambit) has occurred in 5,306 Mega Database 2020 games (0.082%), 1,528 
UltraCorr games (0.068%), and 1,268 CCRL+ games (0.037%). Notice that the end positions of Line | and Line 
300 are not at the same tree depth relative to the classical starting position. The term ply in chess denotes half of 
a move. Line | goes 14 plies deep, while Line 300 is only 3 plies deep. Arena screenshots of the end positions 
of these two selected popular variations are shown in Figure 2. 


1.4 5 Is 2. Nf3 1s d6 3. d4 cxd4 4. Nxd4 1s NI6 Is 5. Nc3 1s a6 
1s 6. Bg5 2s €6 12s 7. £4 Be? 2s 


1.4 c5 1s 2. bd 1s 


Ad i 
Demo Analyze Edit 


RIPAS 


b4 
Time: 00:01.875 


lise Sicilian: Wing Gambit, 1.e4 c5 2.b4 b2b4 


Figure 2. End positions of 2 selected popular variations — Line 1 (top) and Line 300 (bottom). 


Our job is not done here, because we haven’t given our chess students an idea how the events may unfold in 
each of the selected popular and/or promising variations. Clearly, no advanced beginner is going to study thou- 
sands (or hundreds) of games per variation, and neither should they attempt it. Our approach to extending the 
selected popular variations using Brainfish with Cerebellum will be explained in the next section of the paper. 
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ILLUSTRATING THE POPULAR VARIATIONS 


We ran Arena 3.5.1 matches of Brainfish version from February 8, 2019, configured with its Cerebellum open- 
ing book playing blitz games of chess against itself to extract the first lines for the previously selected 1,442 
popular and/or promising variations and 167 popular transpositions. Brainfish was configured to always play 
the best move from its opening book until it ran out of the book moves. The way Arena records the games has 
allowed us to distinguish Cerebellum’s first line continuation of the selected popular variation from how the 
middlegame and endgame stages subsequently played out under the short time control of 3 minutes per game 
plus 2 seconds of time increment per move (3' +2" in the conventional chess notation). 


Figure 3 shows a position after Move 18 and chess notation for Game 111 of Volume 1 of the resulting First 
Lines collection. The game illustrates Line 111 of the selected popular variations set, our hex line code 
1112111111111111111111, ECO code B33, Sicilian: Pelikan, Chelyabinsk, 9. Nd5 Be7, 11. c3 O-O. 


ro] 

¥ 
Date ° 
+ tons Previous Game © Forward 
» Lona Nest Game p View Game Miatery 
Omabase Game Haters 
Notation = Openings Book 7a 
Notation Raterence Table Tring Score sheet Uveliock Openings Book 


+25 104 cS 2.Nf3 No6 3.d4 cod4 4.Nud4 NIG S.Nc3 e&S 6.NdbS dé 7.8g5 26 8.Na3 bS 9. 10.86 Bxf6 11.3 0-0 
Vol 1, Game 111 of 304 12.Nc2 RbB 13.h4 g6 14.93 Bg? 15.hS Bed 16.Nce3 Ne7 17.23 NadS 18. Neos EB o 14/26 7 (Qd8-d Qd1-d2 
Rf8-cB hSing6 H7xg6 Rat-d1 ReB-cS NdS-e3 Rb8-d8 Bf1-d3 d6-d5 e4xdS BesxdS Bd3-<2 BdS<6 Qd2nd7 RdBxd7 Rdl xd? BeGxd7 Ke1-e2 Kg8-t8 
Rht-d1 Bd?-c6 Bc2-b3 K/8-e7 Ne3-d5+ BcGxdS Bb3ud5 a6-a5 g3-g4 a5-a4 Ke2-e3 17-15 g4-g5 ReS-<B Rd1-h1) 7 19.Qd3 0.41/27 22 

(Qd1-d3 Rf8-cB NdS-e3 RcB-<S Ral-d1 RbS-d8 BF -e2 Bes-b3 Rd1-d2 Qd7-e7 Be2-d1 Bb3-c4 Ne3xc$ bSxc4 Qd3-e3 d6-d5 hSxg6 h7xg6 edad 
ReSxdS Rd2xdS RdSxdS Bd1-f3 e5-e4 Rht-h4 RdS-e5 Rhdned (7-15 RedneS Qe7xe5 Qe3ne5) 19_RKB 0.00/26 6 (RIS-<B NdS-e3 ReB-c5 Rat-d1 
Rb8-d8 hSxg {7x96 Ne3-d5 h7-hS Bf1-g2 Kg8-h7 O-O Rd8-18 Nd5-b4 RIB-dB) 6 20.Rd1 0.26/26 14 (Rat-d1 ReB-c5 hSxg6 h7xg6 Nd5-e3 Rb8- 
G8 Qd3-2 Kg8-f8 f2-f3 RcS-cB Qc2-d2 a6-aS Kel -f2 Qd7-a7 Ki2-g2 Be6-b3 Rdt-ct dé-dS etudS Qa?-cS Qd2-f2 a5 3-04 QcS-d4 Rht-h2 
bSnod Rh2-h1) 20_RcS 0.10/29-4 (RcB-c5 hSxg6 h7xg6 NdS-e3 RbS-d8 Bft-e2 KgS-f8 Qd3-b1 Bg7-f6 Qb1-d3 Qd7-e7 Ket-f1 Be6-b3 Kf1-g2 
Qe7-b7 Rd1-e1 d6-d5 Ne3ndS Bb3md5 exdS Qb7xd5 + ReSxd5 ¢3-c4 Rd5-d2 b2-bé bSxct Be2xo4 RdB-d6 Rh1-h7) 4 21.hxg6 0.34/27 0 
(hSug6 h7xg6 NdS-e3 RbS-d8 Qd3-c2 a6-aS 8f1-d3 Qd7-c? Qc2-d2 Qc? -b7 Qd2-e2 RdS-bS Rh1-h4 6g? 6 Rh4-ht Kg8-g7 Qe2 #3 Bf gS 8d3 
2 ReS-cB Rdt-d2 RcB-hS RhtxhS RbGxhS Rd2xd6) +.34/27 O 21...hugs 0.00/28 & (h7xg6 NdS-e3 Rb-dS Bfl-e2 KgS-f8 Ket -f1 f7-fS etafS g6xfS 
Rh1-h5 €5-e4 Qd3-d2 Be6-{7 RhS-h7 BI7-g8 Rh7-h5) 8 22.Ne3 0.20/29 0 (Nd5-€3 RbB-d8 Qd3-c2 a6-a5 Bf1-d3 Qd7-c7 Qc2-d2 KgB-t8 
Rh1-h7 Qc7-b7 Qd2-e2 Rc5-cB Bd3-b1 Kf8-g8 Rh7-h2 b5-b4 a3xb4 aSxb4 c3-c4 Bg7-H6 Ket-f1 KgB-g7 Kft-g2 Rd8-h8 Rdt-h1 Bi6-95 Ne3-d5 
Rh8xh2+ Rhixh2 b4-b3 Bb1-d3 BeSxdS e4xd5) +.20/29 0 22...RdB 0.00/29 10 (Rb6-d8 Bfl-e2 Bg7-f6 Kel-f1 Be6-b3 Ne3-g4 BI6-g7 Ng4-e3) 
23.Qe2 0.08/30 25 (Qd3-c2 a6-a5 Bt1-d3 Qd7-c7 Qc2-e2 d5-d5 edxdS BebudS Ne3ud5 ReSudS O-O bS-b4 c3xb$ aSub4 Bd3-e4 RoSudt Rétxdt 
RdBxd1 + Qe2xd1 béxa3 b2xa3 Bg7-18 a3-a4 Kg8-g) £7-45 Bet-d5 BIB-<S Qd'1-b3 Bc5-b6 Qb3-bS Bb6-d4 BdS-o4 Bd4-b6 Bot-b3 Qc7-c5 
QbSxcS) 23.05 0.29/27 11 (a6-aS Bf -d3 8g? 46 Qc2-e2 Qd?-b7 Qe2-f3 Kg8-g? Kel-e2 8% gS Rht-h2 Rd8-h8 Rh2xhs Kg?xhs Rd1-ht+ Khs. 
8 Ke2-f1 Kg8-g? RcS-<8 QF3-e2 bS-b4 a3xb4 aSxb4 c3-04 RcB-hS Rh1xhS Kg7xhS Ne3-d5 Kh8-g7) 24.8d3 0.06/30 (Bf -d3 Qd7-c7? Bd3 
€2 d6-d5 edad5 BeGad5 Ne3ndS ReSxd5 O-O b5-b4 RdixdS RdaxdS adxb4 aSab4 Qc2-e4 RdS-d2 Be2-c4 banc b2xc3 Cc7-b6 Qe4-13 Qb6-f6 
Qf3x16 Bg7x46 RI1-a1 Rd2-d7 Bed-bS Rd7-e7 Rat-e1 €5-e4 c3-c4 BI6-g5 Kgt-f1 e4-€3 f2xe3 BgSxe3) 24..Bf6 0.23/28 7 (Bg-16 Ke1-f1 Bf6-g5 
Ne3-dS BeéudS edu B-g? Kf1-g2 RcS-<B Qc2-b3 RcB-bs Rh1-h2 Qd7-g4 Qb3-c2 RdB-hS Rh2-h1 Qg4-d7 RhixhS RbSxhS Rd1-h1 RhBxht 
Kg2xht Qd7-g4 Bd3xb5 Qg4-f3+ Kht-g? Qf3xd5 BbS-f1 e5-e4 Qc2-a4 BgS-f6 Qad-b5 QdS-d2) 7 25.Qe2 0.00/26 19 (Qc2-e2 Qd7-b7 Rht-h2 
46-d5 e4ndS BeGndS Ne3-g4 816-97 Bd3-e4 Qb7-cB RdtxdS ReSndS BedudS RdndS Ngt-e3 RdS-c5 Ket-f1 Qc8-d7 Rh2-ht Qd7-<6 Rh1-h2) 
25...QbT 0.31/28 33 (Qd7-b7 Qe2-f3 Qb7-e7 Kel-f1 d6-d5 edxds BeGxdS Bd3-e4 Bd5-b3 Ne3-d5 Bb3xdS RdtxdS RcSudS BedudS e5-e4 Of3xe4 
Qe7inet BdSaet RAS-d2 b2-b4 Bi6xc3 b4xaS Bc3a5 Kf1-g2 BaS-b6 RhT-b1 Bb6xf2 RbTxbS Bf2-cS+ Kg2-f1 Bc5-d6 RbS-dS Rd2xdS Beduds 
26,Rh2 0.00/32 0 (Rh1-h2 d6-d5 edxdS BeGxdS Ne3-g4 BI6-g7 BA3-e4 Qb7-e7 BetxdS ReSxdS RdtxdS RdSxdS No4-e3 RdS-c5 Ket-f1 Qe7-<8 
Qe2-f3 Re5-c6 KI1-g2 Re6-d6 Rh2-h4 Rd6-d2 b2-b4 aSxb4 Rh4xb4 Bg7-h6 Ne3-g4 Bh6-g7) .00/32 0 26..Kg7 0.17/28 11(Kg8-g7 Qe2-d2 RdB 
h8 Rh2xh8 Kg7xhS Kel-f1 Kh8-g8 Kf1-g2 Bf6-g5 Bd3-c2 bS-b4 a3xb4 aSxbs Qd2xd6 BgSne3 (2xe3 Qb7-c7 Qdbxc? ReSxc? Rd1-d3 Kg8-g? 8c2 


a4 RcT-b7 b2-b3 b4xc3 Rd3xc3 Rb7-b4 Ba4-c6 BeGxb3 Rc3-cS Bb3-e6 Kg2-f2 Rb4-b3 Beb-d5 Rb3-b2+ K2-13) 27.Qd2 0.00/32 6 (Qe2-d2 Qb7 
€7 Qd2-e2 Qe7-b7) 27.KIB 0.45/28 4 (Kg7-18 12-13 KIB-gB Qd2-f2 b5-b alxb4 aSxb4 c3-c4 Re5-a5 Kel-f1 RdS-a8 Bd3-b1 Qb7-b6 b2-b3 Rad 
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33. Stotare Pemtan and Sresteace Vanationt 


Figure 3. Screenshot of Game 111 of the First Lines collection viewed in ChessBase 13. 


You can observe that Brainfish has extended the 11-move (22-ply) popular variation with its 13-ply Cerebellum 
first line. After that, you see a series of blitz game moves, starting with the 18" move of the black, 18... Qd7, 
complete with its computer evaluation of -0.14 at a modest depth of 26 plies and the expected PV. The game 
result was a draw, owing both to the nearly even evaluation of the end position of the popular variation (0.26 at 
Depth 49 by Stockfish 13) and to the exactly even strength of the computer opponents (Brainfish playing 
against itself). Interestingly enough, not a single game from Mega Database 2020, UltraCorr 2021, or CCRL+ 
has reached the position after Move 18. 17. a3 appears to be a novelty. 


We decided to look at the win/draw/loss statistics of the 1,442 main games of the First Lines collection grouped 
into six Volumes. The stats by Volume are illustrated in Figure 4. 
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‘Statistics: FiestLines¥olt_SicilianDetense X | Stabstics: firstLinesVol2_OpenGame x 
Total Games: 608 Total Games: 438 
1-0: 23Games *76% Totak 52.1% 1-0: 23 Games =105% Totat 543% 
Yes 271 Games = 00.1% Yeh 192 Games = 07.7% 
0-1 10 Games =33% Totat 47.9% O-1 = 4Games =18% Total 45.7% 


White - Rated games 34 Elo-O 2200 = Performance =2215 
Black =. Rated games. 304 Elo. 2200 Performance: #2185 


White - Rated games 219 0-0 2200 Performance =2230 
Black .Raled games 219 Glo. 2200 Performance: #2170 


NaGO8 @tewtt = OLength Overs Okco-A Oko Otcot Okco-D Otcot Otcowt Nels @iess = Olength Overs Oto Ofco® Okot OfcoD Okot Okoat 
aK z| > . Player Own = Olver @roth mo] fe] [> : Player Olinee Opek @eeeh 
‘Statistics: FiestLines¥olS_SemiOpentctSicitan: X | Sabstice FirstLinesVols_IndianDefense x 


Total Games: 508 Total Games: 498 

1-0: 31 Games =122% Totak 55.1% 1-0: 45 Games =18.1% Total 57.6% 

Yes: 218 Games = 85.8% Yee 197 Games = 79.1% 

0-1 5 Games =20% Totat 44.9% or 7 Games =28% Total 424% 

White - Rated games 254 Elo-O 2200 Performance =2236 ‘White - Rated games 249 G0-@ 2200 = Performance =2253 
Black =. Rated games 254 Elo-O 2200 Performance: #2164 Black -Raled games 249 Elo.O 2200 Performance: #2147 


NaS08 @ftewte = Otength Overs OkcoA Oko Ofcot Okco-D Okcot Okcovt Ne a3t @fexm — Olength Oven 


Player OWhae Ole @roth x] fe] i> - 


Oto Ofco® Okot OfcoD Okot Okoat 


rye Ownte — OBack Orr 


ox <> - 


‘Statistics: FiestLines¥olS_1daNotNts 


X | Stabstics: FirsthinesVolb_ChosedGameNot iad 


White - Rated games 238 Elo-© 2200 Performance =2234 


Total Games: 476 Total Games: 356 

1: 27 Games =113% Totak 54.6% 1-0: 16 Games = 9.0% Totat 53.1% 
Ye%, 207 Games = 87.0% Yer 157 Games = 8.2% 

0-1 4 Games =17% Totat 452% or 5 Games =28% Total 46.9% 


White - Rated games 178 G0-0 2200 Performance =2221 


Brack =. Rated games. 238 ~4Elo.O 2200 Performance: #2166 Black .Raledgames 178 Glo. 2200 Performance: #2179 


Nsd76 @fteute = Otength Ovens Okco-A Oko Olcot Okco-D Okcot Okcovt e356 


Player Olhae Ole @ooth 


@ke® = Olength Overs Oto Ofco8 OkcoC OfcoD Okot Oko-at 


oye | |S Oi Omek @eerh 
Figure 4. Win/draw/loss statistics for the six Volumes of the First Lines collection of games. 
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The numbers of “Total Games” displayed above include the selected lines of popular variations that have no 
game results recorded for them. The stats in Figure 4 confirm that our approach to selecting variations was 
sound, overall. Indeed, the white got 53.6% of points in Mega Database 2020, 53.9% in UltraCorr 2021, and 
53.9% in CCRL+ (with our Brainfish games added). In the next section, we will present and discuss more de- 
tailed statistical analysis of the material of Volume 1, Sicilian Defense. 
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ANALYSIS OF THE POPULAR VARIATIONS 


For the popular variations in Volume 1 of our First Lines collection (Sicilian Defense), we have retrieved nu- 
merous computer evaluations, primarily by Stockfish 12 and Stockfish 13, from ChessBase Engine Cloud, and 
filled the remaining few gaps with our local Stockfish 12 evals. We have added evaluations for 30 transposi- 
tions and 48 extra variations aimed at the students interested in in-depth research of the Sicilian. The depths of 
the engine analysis ranged from 32 to 82 plies, with the median value of 47 plies. We then used Excel to pro- 
duce a linear regression fit of the computer evals to the corresponding percentages of points scored by the white 
in the CCRL+ database. The resulting graph is shown in Figure 5. 


Linear Regression Fit of Evals to Percentages 


% points by the White 


Eval 


Figure 5. The linear regression fit of evals to CCRL+ percentages for Vol. 1 of First Lines. 


The most curious observation here is that the engines that played with the white pieces in CCRL+ games have 
managed to score so well in the variations given the zero eval (“even game’) by Stockfish 12 or Stockfish 13. 
Our follow-up analysis that seems to indicate that the computer evals for nearly even positions seemingly trend 
toward zero as the depth of computer search increases is illustrated in Figure 6. 


Evals vs. Depth 


Depth, plies 


Eval 


Figure 6. Evals vs. Depth. 


In other words, the chess engine sees fewer and fewer chances of beating itself starting with a nearly even posi- 
tion as its analysis gets deeper and deeper. This does not worsen its chances of beating a weaker engine starting 
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from the same position. It’s also important to realize that the computer eval of zero fails to distinguish a game 
with even chances from a “dead draw” that’s practically unavoidable. 


From a practical player’s standpoint, it appears that some of these high-depth zero evals are misleading, as they 
make their variations look less promising in comparison to some others that have not been analyzed to the same 
depth. Meanwhile, the assessments derived from the statistics of Monte Carlo tree search (CPW, 2021) may 
sometimes be proven unreliable by encounters with hard-to-find refutations that lurk like the proverbial “skele- 
tons in the closet”, ready to jump out and wreak havoc on the board. In other cases, evaluations may optimisti- 
cally reflect existence of a narrow path to an acceptable position that has to be navigated extremely carefully to 
avoid the many “landmines”. A player naturally gifted with an exceptionally good memory may still choose to 
learn and memorize the intricate details to be able to play the corresponding variation successfully against those 
less knowledgeable. 


We have also estimated that the main lines of Volume 1 were played in 86.1% of the Mega Database 2020 OTB 
Sicilians and 87.7% of the UltraCorr 2021 correspondence Sicilians. 


CONCLUSIONS AND FUTURE PLANS 


The First Lines collection is a novel and useful tool for showing chess students a map of the modern openings’ 
complicated landscape to help them pick openings and variations that suit their emerging individual styles for 
future in-depth study. The collection can then serve them as a good starting point for building their personal 
opening repertoires by concentrating on some of the openings and specific variations when playing the white 
pieces and preparing to play other openings and variations with the black pieces. Many unwanted openings and 
variations can be avoided, sometimes by cleverly selecting the right transposition of moves. We believe that it is 
to the students’ advantage to learn what works well first, along with the general principles of development in 
the opening, and only then study opening traps for fun at their leisure. At the same time, the First Line collec- 
tion helps us avoid the situation in which a trainer would naturally tend to push students toward the openings 
that the trainer knows best and prefers to play. Those openings may or may not fit the students’ styles — attack- 
ing, defensive, or balanced. 


The author plans to continue to refine and update the First Lines collection. Among possible future projects, the 

author considers the possibility of converting the First Lines collection into a modern book on openings for be- 

ginners and chess instructors, in an electronic and/or traditional paper format. 
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